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Audio coding 



FIELD OF THE INVENTION 

The present invention relates to coding and decoding audio signals. 

BACKGROUND OF THE INVENTION 

5 Refeiring now to Figure 1 , a parametric coding scheme in particular a 

sinusoidal coder is described in US Published AppUcation No. 2001/0032087A1. In this 
coder, an input audio signal x(t) received fiom a channel 10 is spUt into several (overlying) 
segments or fiames, typically of length 2Qnis. Each segment is decomposed into transient 
(Ct), sinusoidal (Cs) and noise (Cn) components. (It is also possible to derive other 

10 conqKMienls of the input audio signal such as harmonic complexes although these are not 
relevant for the purposes of the present invention.) 

The first stage of the coder comprises a transient coder 1 1 including a transient 
detector (TD) 1 10, a transient analyzer (TA) 1 1 1 and a transient synlhesizer (TS) 1 12. The 
detector 110 estimates if there is a transient signal conqKment and its position. This 

15 information is fed to the transient analyzer 111. If the position of atransient signal 

component is determined, the transient analyzer 1 1 1 tries to extract (the main part of) the 
transient signal conqwnent It matches a sh^e function to a signal segment preferably 
starting at an estimated start position, and determines content underneath the shape function, 
by employing for exanq»le a (small) number of smusoidal components. This information is 

20 contained in the transient code Cr. 

The transient code Or is furnished to the transient synth^izer 112. The 
synthesized transient signal component is subtracted from the mput signal x(t) in subtractor 

- 16, resulting in a signal X2. 

The signal X2 is furnished to a sinusoidal coder 13 where it is analyzed in a 
25 sinusoidal analyzer (SA) 130, wMch determines the (deterministic) sinusoidal components. 
The end result of sinusoidal coding is a sinusoidal code Cs and a more detailed example 
illustrating the conventional generation of an exemplary sinusoidal code Cs is provided in 
per patent application No. WO00/79519A1. 
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From fhe smusoidal code Cs generated with fhe sinusoidal coder, the 
sinusoidal signal component is reconstmcted by a sinusoidal synthesizer (SS) 1 3 1 . This 
signal is subtracted in subtracter 17 from the input xa to the sinusoidal coder 13, resulting in a 
remaining signal X3 devoid of (large) transient signal components and (main) deterministic 
S sinusoidal components. 

The remaining signal X3 is assumed to mainly comprise noise and a noise 
analyzer 14 produces the noise code Cn representative of this noise, as described in, for 
example, PCT patent application No. WO01/89086A1 . 

Figures 2(a) and (b) show generally the form of an encoder (NE) suitable for 
10 use as the noise analyzer 14 of Figure 1 and a corresfponding decoder (ND) for use as the 

noise synthesizer 33 of Figure 6 (described later). A first audio signal ri, corresponding to the 
residual X3 of Figure 1, enters the noise encoder comprising a first linear prediction (SE) 
stage which spectrally flattens the signal and produces prediction coefficients (Ps) of a given 
order. More gaierally, a Laguerre filter can be used to provide fi»quency sensitive flattening 
15 of the signal as disclosed in E.G.P. Schuijers, A.W.J, Oomen, A.C. den Brinker and A. J. 
Gerrits, "Advances in parametric coding for high-qualify audio.", Proc. 1st IEEE Benelux 
Workshop on Model based Processing and Coding of Audio (MPCA-2002), Leuven, 
Belgium, 15 November 2002, pp. 73-79. The residual X2 entera a temporal envelope estimator 
(TE) producing a set of parameters Pt and, possibly, a temporally flattened residual T3. The 
20 parameters Pt can be a set of gains describing the temporal envelope. Alternatively, they may 
be parameters derived from Linear Prediction in the firequency domain such as Line Spectral 
Pairs O^rSPs) or Line Spectral Frequencies (LSFs), desaibing a normalised temporal 
envelope, together with a gain envelope. 

In the parametric decoder (ND), a synthetic white noise sequence is generated 
25 (in WNG) resulting in a signal 13' with a temporally and spectrally flat envelope. A teniporal 
envelope generator (TEG) adds the temporal envelope on the basis of the received, quantised 
parameters Pt' and a spectral envelope generator (SEG, a time-varying filler) adds the 
spectral envelope on the basis of the received, quantised parameters Pg' resulting in a noise 
signal ri' corresponding to signal yn of Figure 6. 
30 In a multiplexer 15, an audio stream AS is constituted which includes the 

codes Ct, Cs and Ck- 

The sinusoidal coder 13 and noise analyzer 14 are used for all or most of the 
segments and amount to the largest part of the bit rate budget. 
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It is well known that parametric audio coders can gpive a feir to good quality at 
relatively low bit rates for example 20kbit/s. However, at higher bit rates Hie quaKty increase, 
as a function of increasing bit rate is rather low. Thus, an excessive bit rate is needed to 
obtain excellent or transparent quality. It is therefore difficult to attain transparency using 
parametric coding at bit rates comparable to those o^ for example, waveform coders. This 
means that it is difficult to constract parametric audio coders having an excellent to 
transparent quality without an excessive usage of bit budget. 

The reason for the fundamental difficulty in parametric coding reaching 
transparency is in the objects that are defined. The parametric coder is vety efficient in 
encoding tonal components (sinusoids) and noisy components (noise coder). However, in real 
audio, a lot of signal components fell into a grey area: they can neither be modelled 
accurately by noise nor can they be modelled as (a small number of) sinusoids. TherefiMre, the 
very definition of objects in a parametric audio coder, though vety beneficial fi»m a bit rate 
point of view for medium quaHty levels, is the bottleneck in reaching excellent or transparent 
quality levels. 

At the same time, traditional audio coders (sub-band and transform) give 
excellent to transparent coding quaHty at certain bit rates, typically in the order of 80-130 
kbit/s for stereo signals sampled at 44.1 kHz. Conjbinations of transform and parametric 
coders (so-called hybrid coders) have been proposed for example as disclosed in European 
patent ^Ucation no. 02077032.7 filed on May 24, 2002 (Attorney Docket No. ID 60981 1 
/PHNL020478). Here spectro-temporal intervals of an audio signal, which would otherwise 
be sub-band coded, are selectively coded with noise parameters in an attempt to reduce bit 

rate while maintaining audio quality. 

Alternatively, a transform or sub-band coder might be cascaded with a 
parametric coder of the type shown in Figure 1. However, ihe ejected coding gain for such 
an arrangement, where the parametric coder is preceding fiie transform or sub-band coder, is 
minimal. This because the perceptually most important regions of the audio signal would be 
c^tured by the sinusoidal coder, leaving Uttle possibihty for coding gain in 1bs 

transform/sub-band coder. 

Audio cod^ using spectral flattening and residual signal modelling using a 
small number of bits per sample are disclosed in A Harma and U JL Laine, "Waiped low- 
delay CELP for wide-band audio coding", Proc. AES 17th Int Conf.: High QuaKty Audio 
Coding, pages 207-215, Florence, Italy, 2-5 Sep, 1999; S. Singhal, '^gh quaKty audio 
coding using multi-pulse LPC", Proc. 1990 Int Conf. Acoustic Speech Signal Process. 
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(ICASSP90), pages 1 101-1 104, Atlanta GA, 1990, IEEE Picataway, NJ; and X. Lin, 'IBgh 
quality audio coding using analysis-by synthesis technique", Proc. 1991 Int. Conf. Acoustic 
Speech Signal Process. (ICASSP91), pages 3617-3620, Atlanta GA, 1991, IEEE Picalaway, 
NJ. In a number of studies, it has been shown that this coding strategy enables an excellent to 
transparent quality at bit rates corresfponding to 2 bit/sample for mono signals (88,2 kbit/s for 
44.1 kHz audio). In that respect, tiiey do not exceed the performance of sub-band or 
transform coders. 

It is an object of Ihe present invention to provide a parametric audio coder 
whose bit rate is controllable across a range and which provides high quality levels at a bit 
rate comparable with traditional coders. 

DISCLOSURE OF THE INVENTION 

According to the present invention, there is provided a method according to 

claim 1. 

The inveation provides scalability in a parametric coder, by supplementing the 
noise coder with a pulse train coder. This provides a large range of bit rate operating points 
and merges the two strategies into one coder without introducing a large ov^ead in 
complexity. 

The coding strategies within the noise coder are coccplementary in terms of 
strengths and weaknesses. The Linear Predictor in the pulse train coder, for example, is 
inefBcient in describing a tonal audio segment, but the sinusoidal coder can do this 
eflBciently. Thus, fi>r tonal items like harpsichord, the pulse train coder is imable to deliver 
transparent quality for a coarse quantisation of the residual. For other signals, the prediction 
order of the pulse train coder linear prediction stage has to be very high to allow a coarse 
quantisation of the residual. For noise like signals, decimation of the residual signal is a 
problem and leads to a loss of brightness. 

In the preferred embodiment^ the coding strategies are combined to form a 
base layer using the parametric coder and an additional (bit rate controlled) pulse train layer,. 
The bit rate resources required for the combined techniques are less than the bit rate 
requirements per technique since both methods apply spectral flattening and, consequently, 
the bits needed for this stage only have to be invested once. With the preferred embodiment, 
a bit rate range from 20-120 kbit/s (for stereo signals) can be covered witihi performance 
better than or comparable with that of state-of-the-art coders. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention will now be described, by way of example, with 

reference to the accompanying drawings, in which: 

Figure 1 sho^w^j a conventional parametric cod^; 

Figures 2(a) and (b) show a cottvenlional parametric noise encoder (NE) and 

conesponding noise decoder (ND) respectively;. 

Figure 3 diows an overview of a mono encoder according to a preferred 

embodiment of the present invention; 

Figure 4 shows an overview of a mono decoder according to a first 

embodiment of the present invention; and 

Figure 5 shows an overview of a mono decoder according to a second 

embodimait of the present invention. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the preferred enibodiment, a parametric audio coder of the type shown in • 
Figure 1 is supplemented wifli a pulse train coder of fee type described in P. Kroon, E.F. 
Deprettere and R. J. Sluijter, '-Regular Pulse Excitation - A novel ^oach to effective and 
efficient multipulse coding of speech", IEEE Trans. Acoust Speech, Signal Process, 34, 
1986. Nonetheless, it will be seen that while the embodiment is described m terms of a 
Regular Pulse Excitation (RPE) coder, the invention can equally be implemented with Multir 
Pulse Excitation (MPE) techniques as disclosed in US Patent No. 4,932,061 or an ACELP 
coder as described KL Jarvinen, J. Vainio, P. Kapanen, T. Honkanen, P. Haavisto, R. Salami, 
C. Laflamme, J-P. Adoul, "GSM enhanced fijll rate speech codec", Proc. ICASSP-97, 
Munich (Germany), 21-24 April 1997, Volume 2, pp. 771-774, each of which include a first 

LP based spectrally flattening stage. 

In the preferred embodiment, an overall bit rate budget determined according 
to the quality required ftom the coder, is divided into a bit-rate B usable by the parametric 
coder and an RPE coding budget which is inversely proportional to an RPE decimation fector 

D. 

Referring now to Figure 3, an mput audio signal x is first processed within 
block TSA, (Transient and Sinusoidal Analysis) conesponding with blocks 1 1 and 13 of the 
parametiic coder of Figure 1. Thus, this block generates the associated parameters for 
transients and noise as described in Figure 1 . Given the bit rate B, a block BRC (Bit Rate 
Control) preferably limits the number of sinusoids and preferably preserves transients such 
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that the overall bit rate for sinusoids and transients is at most equal to B, typically set at 
around 20 kfoit/s. 

A waveform is generated by block TSS (Transient and Sinusoidal Synliiesiser) 
corresponding to blocks 112 and 131 of Figure 1 using the transient and sinusoidal 
parameters (Cx and Cs) generated by block TSA and modified by the block BRC. This signal 
is subtracted from input signal x, resulting in signal ri corresponding to residual X3 in Figure 
1. In general, ^gnal ri does not contain sinusoids and transients. 

From signal ri, Ihe spectral envelope is estimated and removed in the block 
(SB) using a Linear Prediction or a Laguerre filt^ as in the prior art Figure 2(a), The 
prediction coefficients Ps of the chosen filter are written to a bitstream AS for transmittal to a 
decoder as part of the conventional type noise codes Cn. Then the temporal envelope is 
removed in the block (TE) generating, for example. Line Spectral Pairs (LSP) or Lme 
Spectral Frequencies (LSF) coefficients together with a gain, again as described in the prior 
art Figure 2(a). In any case, the resulting coefBcients Pt flrom the temporal flattening are 
written to the bitstream AS for transmittal to the decoder as part of the conventional type 
noise codes Cn- Typically, the coefficients Ps and Pt require a bit rate budget of 4-5kbit/s. 

Because pulse train coders employ a first spectral flattening stage, the RPE 
coder can be selectively applied on the spectrally flattened signal tz produced by the block SE 
according to whether a bit rate budget has been allocated to the RPE coder. In an alternative 
embodiment, radicated by the dashed line, tiie RPE coder is applied to the spectrally and 
temporally flattened signal T3 produced by the block TE. 

As is known from the documents referred to in the background, the RPE coder 
performs a search in an analysis-by-synthesis manner on the residual signal T2 /X3. Given a 
decimation factor D, the RPE search procedure results in an offeet (value between 0 and D- 
1), the amplitudes of the RPE pulses (for example, ternary pulses with values -1, 0 and 1) and 
a gain parameter. This information is stored in a layer Lo included in the audio stream AS for 
transmittal to the decoder by a multiplexer ^MUX) when RPE coding is employed. 

Typically, the RPE coder-requires a bit rate of at least 40 kbit/s or so and is - 
therefore switched on as the quality requirement and so bit budget of the encoder is increased 
towards the higher end of the quality range. For the lower part of the quality range where the 
RPE coder is initially employed, the bit rate B is decreased to less than the msncimi^m bit rate 
allowed for when the parametric coder is employed alone. This enables a monotonically 
increasing overall bit rate budget range to be specified for the coder with quality increasing in 
proportion to the budget. 
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Ejcperiments showed that the RPE coder results in a loss in brightness in the 
reconstructed signal, especially when using high decimation fectors (e.g. D=8). Adding some 
low-level noise to the RPE sequence mitigates this problem. In order to determine the level of 
the noise, a gain (g) is calculated on basis o^ for exan^le, the energy/power difference 
between a signal generated from the coded RPE sequence and residual signal Tz/ra. This gain 
is also transmitted to the decoder as part of the layer Lo mfonnation. 

Referring now to Figure 4, a first embodiment of the decoder compatible with 
Ihe embodiment of Figure 1 where the RPE block processes the residual signal nis shown. A 
de-multiplexer (DeM) reads an incommg audio stream AS' and provides the sinusoidal, 
transient and noise codes (Cs, Or and Cn(Ps,Pt)) to respective synthesizers SiS, TrS and 
TEG/SEG as in the prior art. As in &e prior art, a white noise generator (WNG) supplies an 
ioput signal for the temporal envelope generator TEG. In the embodiment, Vfbsace the 
information is available, a pulse train generator (PTG) generates a pulse train firom layer Lo 
and this is mixed in block Mx to provide an excitation signal ra'. It will be seen fiom the 
encoder, that as the noise codes Cn(Ps,Pt) and layer Lo were generated independently fixjm 
the same readual rz, the signals they generate need to be gain modified to provide the correct 
energy level for the synthesized excitation signal rz'.In this embodhnent, in a mixer (Mx), 
the signals produced by the blocks TEG and PTG are fi^quency weighted, so that for low 
ftequencies, most of the signal rz' is derived fixjm the pulse coded information Lo and for 
hi^ ftequencies most of the signal rj' is derived fiom the synthesized noise source 
WNG/TEG. 

The excitation signal T2' is then fed to a spectral envelope generator (SEG) 
which accordmg to the codes Ps produces a synthesized noise signal n'. This signal is added 
to ihe synthesized signals produced by flie conventional transient and sinusoidal synthesizers 
to produce the output signal x. 

In an alternative embodimeni, the signal generated by the pulse train generator 
PTG is used instead of the signal generated by WNG as an input to the ten^Ksral envelope 
- generator as indicated by the hasdlied line. 

Referring now to Figure 5, a second embodiment of the decoder corresponds 
with the embodiment of Figure 1 where the RPE block processes the residual signal ra. Here, 
the signal generated by a white noise generator (WNG) and processed by a block We, based 
on the gain (g) determined by the coder; and fte pulse train generated by the pulse train 
generator (PTG) are added to construct an excitation signal 13*. Where layer Lo information is 
available, within block We, the noise sequence is high-pass filtered to remove the low 
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firequencies, which perceptually degrade the reconstructed excitation signal - as in the first 
emhodiment of the decoder, these components of the synthesized noise signal are based on 
the output of the pulse train generator rather than the noise based excitation signal. Of course, 
where layer Lo information is not available, the white noise is fed through the block We to be 
5 provided as the excitation signal ra' to a temporal envelope generator block (TEG). 

The tsmporal envelope coefficients (Pt) are Ihm imposed on the excitation 
signal ra' by the block TEG to provide the synthesized signal T2* which is processed as before. 
As motioned above, this is advantageous because a pulse train excitation typically gives rise 
to some loss in brightness which, with a properly weighted additional noise sequence, can be 
10 counteracted The weighting can comprise simple amplitude or spectral shaping each based 
on the gain factor g. 

As before, the signal is filtered by, for example, a Laguerre filter in block SEG 
(Spectral Envelope Generator), which adds a spectral envelope to the signal. The resulting 
signal is then added to the synthesized sinusoidal and transient signal as before. 
15 It will be seen that in either Fig 4 or Fig 5, if no PTG is being used, the 

decoding scheme resembles the conventional sinusoidal coder using a noise coder only. If 
the PTG is used, a RPE sequence is added, which enhances the reconstructed signal i.e. 
provides a higher audio quality. 

It should be noted that in the CTibodiment of Figure S, in contrast to the 
20 standard pulse coder (RPE or MPE), where a gain which is fixed for a complete fiame is 
used, a temporal envelope is incorporated in the signal ri'. By using such a temporal 
envelope, a better sound quality can be obtained, because of the higher flexibility in the gain 
profile compared to a fixed gain per frame. 
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I A method of encoding an audio signal (x), the method comprising, for each of 

apluialily of segments of the signal, the steps of: 

analysing (TSA) the san^led signal values to provide one or more sinusoidal 
codes (Cs) corresponding to respective sinusoidal components of the audio signal; 

subtracting a signal corresponding to said sinusoidal components from said 

audio signal to provide a first residual signal (ri); 

modelling (SE) the frequency spectrum of Ihe first residual signal (ri) by 
determining first fillBT parameters (Ps) of a filter which has a firequency response 
ffimroximating a frequency spectrum of the first residual signal; 

subtracting a signal corresponding to said first filter parameters from the first 
residual signal to jncovide a second residual signal (ra); 

modelling (RPE) a component (rj^ra) of Uw second residual signal with a pulse 
train coder (RPE) to provide respective pulse train parameters (Lo); and 

generating (15) an encoded audio stream (AS) including said sinusoidal codes 
(Cs), said first filter parameters (Ps) and said pulse train parameters (U). 



2. A method as claimed in claim 1 further con^rising the steps of: 

modelling (TE) liie temporal envelope of each second residual signal by 

determining second parameters (Pt), and 
20 providing a third residual signal (ra) by removing from the second residual 

signal the temporal envelope corresponding to said second parameters; 

wherein said component of the second residual signal comprises a respective 

third residual signal and 

wherein said generating step includes said second parameters in said encoded 

25 audio stream (AS). 



3. 



A method as claimed in claim 1 finrflier coniprising the step o£ 
modelling (TEG) the temporal envelope of the second residual signal by 
determining second parameters (Pt), and 
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wherein said compon^t of each second xesidual signal comprises said second 
residual signal (12); and 

wherein said generating step includes said second parameters in said encoded 
audio stream (AS). 

4. A method as claimed in claims 2 or 3 further comprising the step of: 

estimating a difference between a signal corresponding to said pulse train 

parameters and said component (r2,r3) of each second residual signal; and 

wherein said generating step includes an indicator of said difference (g) in said 

encoded audio stream (AS). 

3. A method as claimed in claim 1 wherein said pulse train coder is one of a 

regular pulse excitation (RPE) coder; a mxiltiple-pulse excitation (MPE) coder; or an ACELP 
coder. 

6. A method as claimed in claim 1 wherein said first filter parameters (Ps) 

comprise one of: Laguerre or Linear Prediction filter parameters. 

7- A method as claimed in claim 2 or 3 wherein said second parameters (Pt) 
comprise one of: Linear Prediction parameters or Line Spectral Pairs (LSP) or Line Spectral 
Frequencies (LSF) coefficients together willi respective gains. 

8- A method as claimed ia claim 1 wherein said method comprises the step of: 

estimating (TSA) a position of a transient signal component in the audio 

signal; 

matching a shape function having shape parameters and a position parameter 
to said transient signal; and 

iucluding (1 5) the position and shape parameters describing the shape function 
in said audio stream (AS). 

9. A method as claimed in claim 1 wherein the number of said sinusoidal 

components is limited by a first bit rate budget (B), wherein said pulse train coder is limited 
to producing said pulse train parameters (Lo) within a second bit rate budget, and wherein the 
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sum of said first and second bit rate budgets is selected fix>m a range according to a required 
quality of encoding. 

10. Method of decoding an audio stream, the method comprising the steps of: 
5 reading (DeM) an encoded audio stream (AS') including, for each of a 

plurality of segments of an audio signal: sinusoidal codes (CS), pulse train parameters (Lo), 
and first filter parameters (Ps); and 

employing (SiS) said sinusoidal codes to synthesize respective sinusoidal 

components of the audio signal; 
10 employing (PTG) said pulse train parameters (Lq) to generate an excitation 

signal; 

imposing (SEG) a spectral envelope according to said first filter parameters 
(Ps) on a first signal (r2') a component of which comprises said excitation signal, and 

adding said synthesized sinusoidal components and said spectrally filtered 
1 5 signal to produce a synthesized audio signal ( ^ ). 

11. A method according to claim 10 wherein said encoded audio stream includes 
second parametras (Pt), said method comprising the step of: 

imposing (TEG) a temporal envelope according to said second filter 
20 parameters (Pt) on a second signal (rs') a component of which comprises said excitation 
signal, and 

wherein said fiorst signal comprises said temporally filtered signal (ra')- 



12. A method according to claim 1 1 further comprising the steps of: 
25 generating (WNG) a white noise signal; and 

adding said white noise signal to said excitation signal to provide said second 

signal (ra*)- 

13. A method according to claim 12 further comprising: 
30 high-pass filtering (We) said white noise signal. 

14. A method according to claim 12 wherem a gain (g) to be ^lied to said white 



noise signal is read fix>m said audio stream. 
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15. A method according to claim 10 wherein said encoded audio stream includes 
second filter parameters (Pt), the method comprising the step of: 

imposing (TEG) a time domain envelope according to said second filter 
parameters (Ps) on said excitation signal, and 
S wherein said spectral envelope is imposed on said temporally filtered signal 

16. A method according to claim 10 wherein said encoded audio stream includes 
second filter parameters (Px), the method conqnising the steps of: 

generating (WNG) a white noise signal; 

imposing (TEG) a time domain envelope according to said second filter 
parameters (Ps) on the white noise signal, and 

mixing said temporally filtered white noise signal with said excitation signal to 
provide said second signal (r2'); 

wherein said spectral envelope is imposed on said second signal (ri'). 

17. A method according to claim 16 wherein said mixing step comprises spectrally 
weighting said temporally filtered white noise signal and said excitation signal. 

20 1 8. Audio coder arranged to process a respective set of sanq>led signal values for 

each of a plurality of sequential segments of an audio signal (x), said coder comprising: 

an analyser (TSA) arranged to analyse the sampled signal values to provide 
one or more sinusoidal codes (Cs) corresponding to resfpective sinusoidal components of the 
audio signal; 

25 a subtracter arranged to subtract a signal corresponding to said sinusoidal 

components fiom said audio signal to provide a first residual signal (ri); 

a modeller (SEG) arranged to model Hhe frequency sfpectrum of the first 
residual signal (ri) by determining first JSlter parameters (Ps) of a filter which has a firequency 
response approximating a frequency spectrum of the first residual signal; 
30 a subtracter arranged to subtract a signal corresponding to said first filter 

parameters from the first residual signal to provide a second residual signal (ra); 

a modeller (RPE) arranged to model a component (ra^ra) of the second residual 
signal with a pulse train coder (RPE) to provide respective pulse train parameters (Lo); and 
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a bit stream generator (15) for generating an encoded audio stream (AS) 
including said sinusoidal codes (Cs), said first filter parameters (Ps) and said pulse train 
parameters (Lo). 

1 9. Audio player, con^sing: 

means for reading (DeM) an encoded audio stream (AS') including, for each 
of a plurality of segments of an audio signal: sinusoidal codes (CS), pulse train parameters 

OLo), and first filter parameters (Ps); and 

a synthesizer (SiS) arranged to employ said sinusoidal codes to synthesize 

respective sinusoidal components of the audio signal; 

means (PTG) for generating an excitation signal from said pulse train 

parameters (Lo); 

means for imposing (SEG) a spectral envelope according to said first filter 
parameters (Ps) on a first signal (r2') a component of which comprises said excitation signal, 
and 

an adder for adding said synthesized sinusoidal components and said 
spectrally filtered signal to produce a synthesized audio signal ( x ), 

20. Audio system comprising an audio coder as claimed in claim 1 8 and an audio 
player as claimed in claim 19. 

2L Audio stream (AS) comprising sinusoidal codes (Cs) corresponding to 

respective sinusoidal components of an audio signal (x); first filter parameters (Ps) for a filtCT 
which has a fi^quency response approximating a frequency spectrum of a first residual 
signal, said first residual signal corresponding to said audio signal with a signal 
corresponding to said sinusoidal components subtracted; and pulse train parameters (Lo) 
modelled fiom a component (r2,r3) of a second residual signal, said second residual signal 
corresponding to first residual sigcial with a signal corresponding to said first filter 
parameters subtracted. 



22. Storage medium cm which an audio stream (AS) as claimed in claim 21 has 

been stored. 



ill 
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ABSTEIACT: 



An audio coder is anaaged to process a respective set of san^led signal A^ues 
for each of a plurality of sequential segments of an audio signal (x). The coder conq>rises an 
analyser (TSA) arranged to analyse the sampled signal values to provide one or more 
sinusoidal codes (Cs) corresponding to re^ective sinusoidal coiiq)onents of the audio signal. 

5 A subttactor subtracts a signal corresponding to the sinusoidal components from the audio 
signal to provide a first residual signal (n). A modeller (SEG) models the frequency spectrum 
of the first residual signal (ri) by determining first filter parameters (Ps) of a filter which has 
a frequency response ^proximating a frequency spectrum of the first residual signal. 
Another subtractor subtracts a signal corresponding to the first filter parameters from the first 

10 residual signal to provide a second residual signal (xi). Another modeller (RPE) models a 
component (r2,r3) of the second residual signal with a pulse train coder (RPE) to provide 
respective pulse train parameters (Lo). A bit stream generator (15) generates an encoded 
audio stream (AS) including the sinusoidal codes (Cs), Ihe first filter parameters (Ps) and the 
pulse train parameters (Lo). 



Fig. 3 
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